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the  EFFICIENCy  OF  THE  S/lMPLE  MEDIAN 
by  Harold  Hotelling 

Institute  of  Statistics,  University  of  North  Carolina 

A random  sample  of  2n  + 1 is  drawn  from  a univariate  distribution  concerning 
which  nothing  is  knovm  excepting  that  its  variance  is  finite.  VJhat,  if  any,  least 
upper  bound  exists  for  the  ratio  of  the  mean  square  error  in  the  sample  median, 
regarded  as  an  estimate  of  the  popiilation  median,  to  the  mean  square  deviation  in 
the  popylation  from  the  population  median?  We  ansiirarj  This  ratio  can  be  xonity  but 
no  more.  For  symmetric  populations  this  means  that  the  sampling  variance  of  the 
median  is  less  than  or  equal  to  the  variance  of  the  population.  If  the  population 
variance  is  infinite,  that  of  the  sample  median  may  be  finite  or  infinite.  Thus  for 
the  Cauchy  distribution  the  variance  of  the  sarple  median  is  infinite  for  samples  of 
three  but  finite  for  larger  samples.  Our  theorem  deals  only  with  populations  of 
finite  variance. 

If  the  population  has  onl3'-  two  values,  each  with  probability  l/2,  the  distri- 
bution of  the  median  of  any  random  sanqjle  of  odd  size  is  exactly  the  same,  and  it? 
variance  therefore  the  same,  as  for  a single  observation  from  the  same  population. 

We  shall  show  that  this  is  the  extreme  case.  But  the  existence  of  this  case,  in 
which  the  median  fails  to  gain  in  accuracy  as  the  saiiple  size  increases,  is  a warn- 
ing against  the  use  of  the  median  in  the  absence  of  any  knowledge  of  the  form  of  the 
basic  distribution,  and  eliminates  an  arg\unent  occasionally  given  for  using  the 
median  as  a statistic  of  location  rather  than  the  arithmetic  mean.  It  is  true  that 
the  arithmetic  mean  also  fails  for  certain  distribution 3, typified  by  that  of  Cauchy, 
to  gain  in  accuracy  with  the  size  of  the  sample;  but  the  Cauchy  distribution  with 
its  infinite  high  tails  is  much  less  like  those  to  be  expected  in  the  run  of  appli- 
cations than  is  the  symmetrical  two-point  distribution  , which  at  least  has  the 
realistic  property  of  a finite  range. 

Adverse  reflections  on  the  use  of  the  median  result  not  only  from  this  remark 
that  it  is  sonretiiTios  no  more  accurate  than  a single  observation  taken  at  random,  but 
also  from  recent  vfork  on  order  statistics.  Thus  Smith  and  Jones  whose  work 

was  further  extended  by  Siddiqui  7,  find  that  in  samples  from  the  Laplace  double 
exponential  distribution. 
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for  vhich  the  median  is  the  maximum  likelihood  estimate  of  p.,  certain  linear  fimc- 
tions  of  ordered • observations  have  smaller  variances  and  are  unbiased.  For  random 
samples  of  three,  — ^2  — ^3  obtain  as  the  unbiased  linear  estimate  of 

minimum  variance. 


(lix^  + 19x2  + ipc^)/27. 

I calculate  from  their  Qovariances  among  the  ordered  observations , that . this  linear 
function  has  variance  .5895,  whereas  the  median  has  variance  .638I.  This  is  a 
notable  case  of  the  maximum  likelihood  estimate  failing  to  give  minimum  variance 
among  xinbiased  estimates  of  the  same  parameter.  However  the  advantage  of  about 
eight  per  cent  for  samples  of  throe  fades  to  zero  with  increase  in  size  of  the 
sample,  vhile  the  linear  estimates  have  increasingly  complicated  sets  of  coefficients j 
so  for  samples  of  sizes  above  the  smallest  it  appears  reasonable  to  continue  to  use 
the  median. 

The  moral  seems  to  bo,  not  that  the  median  is  a bad  statistic,  but  that  it 
should  not  he  used  indiscriminately  in  the  absence  of  any  information  about  the  form 
of  the  population  sampled.  When  it  is  used,  probabilities  relevant  to  the  inferences 
to  be  made  should  be  calculated  from  its  distribution,  exact  or  approximate.  Those 
calculations,  particularly  approximations  for  large  samples,  may  bo  facilitated  by 
the  results  of  /”3_7» 

We  shall  now  prove  that  the  mean  sqjjare'  error  in  the  sample  median  cannot 
exceed  that  gf  a random  observation,  - a proposition  which  may  at  first  sight  appear 
self-evident. 

Without  loss  of  generality  're  shall  assume  the  population  median  of  the 
variate  x to  be  zero,  and  also  that  ^ x^  =1.  Vfith  the  notation  m for  the  median 
value  of  X in  a sample  of  2n  + 1 independent  values  of  x,  our  theorem  will  be  proved 
by  showing  that 


Denoting  by  F(x)  the  cumulative  distribution  function  of  the  population,  wo 
shall  use  the  inverse  of  this  function,  x = x(F),  making  this  inverse  definite  on 
the  open  segment  0 < F < 1 by  putting  x(i)  = 0 in  accordance  with  the  assmption  of 
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zero  population  median,  and  otherwise  assigninj?  to  x(F)  the  minimum  value  of  x for 
which  F(x)  takes  a specified  value.  Then  x(F)  is  non-decreasing. 

The  distribution  of  the  median  is  well  known  to  be  given  by 

C /‘F(m_)7'^^“l  - F(m_)7”  dF(m), 

-2 

where  C =•  (2n+l)  1 (n  l)  . Let  us  put 

g(F)  - C F^(l-F)”. 


We  thus  have 

^ m^  - " ^ /"x(F)_7^  g(F)  dF. 

■^o 

It  is  convenient  to  use  the  f-unction 


u(F)  - r x(F)_7^  + /'x(l-P)_7^  - 2. 


The  non-decreasing  character  of  x(F)  and  the  fact  that  x(F)  <0  for  F < s and  > 0 

1 T ^ 

for  ^ ^ 5 make  it  clear  that  u(F)  is  non-increasing  for  F < g and  non-decreasing 
for  F > j.  From  the  definition  of  g(F), 

g (F)  = g(l  - F),  g(F)  dF  * 1. 


Since  we  have  taken 


to  be  unity. 


1 - C ^ x(F)_7^  dF  = ( ^ 


i /"x(F)_7  ^ + /"x(1-F)_7  ^ } dF 


1 

J^^u(F)  dF  + T., 


so  that 


From  these  relations  we  have 


h 


C m 


1 
f 5 


\ ( r x(Ft7  + /‘x(l-F)_7  ^ } g(F)  dF 

(u(F)  + 2 J g(F)  dF 
u(F)  g(F)  dF  + 1. 
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Since  u(F)  equals  -2  for  F=«  ^ and  is  non-increasing  on  0 < F < i,  and  since 

its  integral  is  zero,  there  exists  a number  F , 0 < F < such  that  u(F)  > 0 for 

0 < F < F^  and  u (F)  <0  for  ^ Furthermore,  since  g(F)  is  an  increasing 

fvinction  on  0 < F < |,  its  values  are  less  than  s(Fq)  for  F < F^  and  greater  than 

g(F  ) for  F < F < i „ . - 

® o o 2.  Consequently, 

' F F 

u(F)  g(F)dF<g(F^)  j ° u(F)  dF, 

and 


1 u(F)  I g(F)  dF  > g(F^) 


I u(F)  I dF. 


In  the  last  integral  u(F)  < 0 so  that  the  inequality  is  reversed  when  the  absolute 
value  signs  are  removed.  Combining  with  the  preceding  inequality  gives 

1 1 

J u(F)  g(F)  dF  < g(F^)  ] ^ u(F)  dF  = 0 

o 

Therefore 

i m^  < 1, 

as  was  to  be  proved. 
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